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ABSTRACT 

We show that a large-area imaging survey using narrow-band filters could detect 
quasars in sufficiently high number densities, and with more than sufficient accu- 
racy in their photometric redshifts, to turn them into suitable tracers of large-scale 
structure. If a narrow-band optical survey can detect objects as faint as i = 23, it 
could reach volumetric number densities as high as ft. 3 Mpc~ 3 (comoving) at 
z ~ 1.5. Such a catalog would lead to precision measurements of the power spectrum 
up to z ~ 3 — 4. We also show that it is possible to employ quasars to measure baryon 
acoustic oscillations at high redshifts, where the uncertainties from redshift distortions 
and nonlinearities are much smaller than at z < 1. As a concrete example we study 
the future impact of J-PAS, which is a narrow-band imaging survey in the optical over 
1/5 of the unobscured sky with 42 filters of ~ 100 A full- width at half-maximum. We 
show that J-PAS will be able to take advantage of the broad emission lines of quasars 
to deliver excellent photometric redshifts, o z ~ 0.002 (1 + z), for millions of objects. 

Key words: quasars: general - large-scale structure of Universe 



1 INTRODUCTION 

Quasars are among the most luminous objects in the Uni- 
verse. They are believed to be powered by the accretion 
disks of giant b l ack holes that lie at the cent e rs of galaxies 
ISalpeterl (fl964h ; IZel'Dovich fc Novikovl (| 19651 ) ; iLvnden-Belll 
(|l969l )]. and the extreme environments of those disks are re- 
sponsible for emitting the "non-stellar continuum" and the 
broad emission lines that characterize the spectral energy 
distributions (SEDs) of quasars and most other types of Ac- 
tive Galactic Nuclei (AGNs). 

However, even though all galaxy bulges in the local Uni- 
verse seem to host supermassiv e black holes in their centers 
|Kormendv fc Richstoiiej (|l995h ], the duty c ycle of quasars is 
much smaller than the age of the Universe [Richstone et al.1 
(|l998l )]. This means that at any given time the number den- 
sity of quasars is small compared to that of galaxies. 

As a consequence, galaxies have been the preferred 



tracers of large-scale structure in the Universe: their high 
densities and relatively high luminosities allow astronomers 
to compile large samples, di stributed across v a st volumes. 
Both spectroscopic [see, e.g., ICole et all (|2005K ;I York et al.1 
(|2000l ): BO S^Bl and broad-ban d (e .g., ugriz) photomet- 
ric surv eys (Scoville et al.1 (|2007f ) ; lAdelman-McCarthv et all 
(|2008al lbf)] have been used with remarkable success to study 
the distribution of galaxies, particularly so for the subset 
of luminous red galaxies (LRGs), for which it is possible 
to obtain relatively good photometric redshifts (photo-z's) 
[a z ~ 0.01 (1 + z)] e v en wi t h broad- b and f ilt er photom- 
etry jBolzonella et all d200Ch ; iBemtezI J2000h; iFirth et al 



(|2003f): IPadmanabhan et al. I (|2005l . 120071 ): I Abdalla et al 
l2008d lblkTi]~ From a purely statistical perspective, photo- 
metric surveys have the advantage of larger volumes and 
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densities than spectroscopic surveys - albeit with dimin- 
ished spatial resolution in the radial direction, which can be 
a limiting factor for some ap plications, in particula r baryon 
acoustic oscillations (BAOs) [Blake fc Bridle! (120051 1]. 

Most ongoing wide-area su rveys choose one of t he par- 
allel strat egies o f imag ing [e.g.. lAbbott et all (|2005l ); PAN- 
STARRS ^Abelll <|2009l )] or multi-object spectroscopy (e.g., 
BOSS), and future instruments will probably continue fol- 
lowing these trends, since spectroscopic surveys need wide, 
deep imaging for target selection, and imaging surveys need 
large spectroscopic samples as calibration sets. 

However, whereas LRGs possess a signature spectral 
feature (the so-called A r0 st ~ 4,000 A break), which trans- 
lates into fairly good photo-z's with ugriz imaging, the SEDs 
of quasars observed by broad-band filters only show a similar 
feature (the Ly-a line) at A ros t ~ 1,200 A, which makes them 
UV-dropout objects. The segregation of quasars from stars 
and unresolved galaxies in color-color and color- magnitude 
diagrams has allowed the construction of a high-purity cat- 
alog of ~ 1.2 x 10 6 photome trically selected quasars in the 
SDSS (Richards etahl i[2008l )], and the (broad-band) pho- 
tometric redshifts of z < 2.2 objects in that catalog can 
be estimated by th e passage of the emiss ion lines from one 
filter to the next Richards et all |200ll )]. More recently, 
ISalvato et al ] (|2009f r showed that a combination of broad- 
band and medium-band filters reduced the photo-z errors 
of the XMM-COSMOS sources down to cr 2 /(l + z) ~ 0.01 
(median). 

The SPSS spec tr oscopic catalog of quasar s 
[Schneider etafl (|2003h ; ISchneider etUT d2007l, Wchl 



is five times bigger than previous samples Croom et alj 
(|2004h ], but includes only ~ 10% of the total number of 
good candidates in the photometric sample. Furthermore, 
that catalog is limited to relatively bright objects, with 
apparent magnitudes i < 19.1 at z < 3.0, and i < 20.2 
for objects with z > 3.0. Despite the sparseness of the 
SDSS spectroscopic catalog of quasars (the comoving 
number density of objects in that catalog peaks at < 10~ 6 
Mpc -3 around z ~ 1), it has been successfully employed 
in s everal measurements of large-scale structure - see, 
e.g., iPorciani et~ai1 (12004); Ida Angela et all (120051 ). which 
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used the 2QZ survey [Croom et all (|2005l . l2009bl )] for the 



first modern applications of quasars in a cosm ological 
context; see IShen et~ai1 (|2007l ); IRoss et alj (|2009T ) for the 
cosmological impact of t he SDSS quasar survey; and 
iPadmanabhan et al.l (|2008l ). which cross-correlated quasars 
with the SDSS photometric catalog of LRGs. One can also 
use quasars as a backlight to illuminate the intervening 
distribution of neutral H, which ca n then be u sed t o 
compute the mas s power spectrum [Croft et al.l ([19981 ); 
ISeliak et alj l[2005l )]. 

The broad e missio n lines of type-I quasars 
[Vanden Berk et all (|200ll )]. which are a manifestation 
of the extremely high velocities of the gas in the envi- 
ronments of supermassive black holes, are ideal features 
with which to obtain photo-z's, if only the filters were 
narr ow enough ( AA < 4 00 A) to capture those features. In 
fact, IWolf et all |2003bl ) have produced a sample of a few 



hundred quasars using a combination of broad-band and 
narrow-band filters (the COMBO-17 survey), obtaining a 
photo-z accuracy of approximately 3% - basicall y the same 
accurac y that was for their catalog of galaxies Wolf et ahl 
|2003al )]. Acquiring a sufficiently large number of quasars 
in an existing narrow-band galaxy survey would be both 
feasible and it would bear zero marginal cost on the survey 
budgets. 

Fortunately, a range of science cases that hinge on large 
volumes and good spectral resolution, in par ticular galaxy 
survey s with the goal of mea s uring B AOs Peebles fc Yu 




Zeldovich] (ll970T); iBond fc Efstathiou 
n| (Il989l); iHu fc Sugivamal (Il995l)l. both 
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in th e a ngular lEisenstein et alj (120051); iTegmark et al 



ip006l) ; 
(12007 



iBlake et al. 



(120071) ; 



IPadmanabhan et al. 



iPercival et"al~1 (l2007f)l and in the radial di 



rections lEisenstein fc Hul ( 19991); Eisenstein~l 
Blake fc Glazebrookl (120031); ISeo fc Eisensteinl 



2003); 



2003): 



Angulo et aU (|2008l ); ISeo fc Eisensteinl (|2007h ] has stim- 
ulated astronomers to construct new instruments. They 
should be not only capable of detecting huge numbers of 
galaxies, but also of measuring much more precisely the 
photometric redshifts for these galaxies - and that means 
either low-resolution spectroscopy, or filters narrower than 
the ugriz system. 

Presently there are a few instruments which can be 
characterized either as narrow-band imaging surveys, or low- 
resoluti on multi-obj e ct spe ctroscopy su rveys : the A lhambra 
survey [Moles et alj (|2008D j. PRIMUS [Cooll (|200Sl )l, HET- 



DEXQ and the PAU survejjj. The Alhambra survey uses the 
LAICA camera on the 3.5 m Calar Alto telescope, and is 
mapping 4 deg 2 between 3,500 A and 9,700 A, using a set 
of 20 filters equally spaced in the optical plus JHK broad 
filters in the NIR. PRIMUS takes low-resolution spectra of 
selected objects with a prism and slit mask built for the 
IMACS instrument at the 6.5 m Magellan/Baade telescope. 
PRIMUS has already mapped ~ 10 deg 2 of the sky down 
to a depth of 23.5, and has extracted redshifts of ~ 3 x 10 5 
galaxies up to z = 1, with a photo-z accuracy of order 1% 
[Coil et alj (|2010l )]. HETDEX is a large-field of view, integral 
field unit spectrograph to be mounted on the 10 m Hobby- 
Eberly telescope that will map 420 deg 2 with filling factor 
of 1/7 and an effective spectral resolution of 6.4 A between 
3,500 and 5,500 A. The PAU survey will use 40 narrow-band 
filters and five broadband filters mounted on a new camera 
on the William Herschel Telescope to observe 100-200 deg 2 
down to a magnitude iab ~ 23. All these surveys will detect 
large numbers of intermediate- to high-redshift objects (in- 
cluding AGNs), and by their nature will provide very dense, 
extremely complete datasets. 

Another instrument which plans to make a wide- 
area spectrophotometric map of the sky is the Javalambre 
Physics of the Accelerating Universe Astrophysi cal Survey 
(J-PAS). The instrument [see lBemtez et alj (|2009l )] will con- 
sist of two telescopes, of 2.5 m (T250) and 0.8 m (T80) aper- 
tures, which are bein g built at Sier r a de J avalambre, in main- 
land Spain (40° N) [Moles et al l (|2010l )]. A dedicated 1.2 
Gpixel survey camera with a field of view of 7 deg 2 (5 deg 2 



2 Pan-STARRS technical 
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Figure 1. Throughputs of the original J-PAS filter system, as- 
suming an airmass of 1.2, two aluminum reflections and the quan- 
tum efficiency of the LBNL CCDs (N. Bem'tez, private communi- 
cation). The 42 narrow-band filters are spaced by 93 A, with 118 
A FWHM, and span the interval between 4,250 A and 8,200 A. 
The final filter system for J-PAS is still under review, and may 
present small deviations from the original filter set of Bem'itez et 
al. (2009) - see Bem'itez et al (2011), to appear. We have checked 
that the results presented in this paper are basically insensitive 
to these small variations. 



effective) will be mounted on the focal plane of the T250 
telescope, while the T80 telescope will be used mainly for 
photometric calibration. The survey (which is fully funded 
through a Spain-Brazil collaboration) is planned to take 4-5 
years and is expected to map between 8,000 and 9,000 deg 2 
to a 5<r magnitude depth for point sources equivalent to 
zab ~ 23 (i ~ 23.3) over an aperture of 2 arcsec 2 . The filter 
system of the J-PA S instrument, as originally described in 
iBemtez et ail (|2009l ). consists of 42 contiguous narrow-band 
filters of 118 A FWHM spanning the range from 4,300 A 
to 8,150 A - see Fig. [T] This set of filters was designed to 
extract photo-z's of LRGs with (rms) accuracy as good as 
u z ~ 0.003 (1 + z). Of course, this filter configuration is also 
ideal to detect and extract photo-z's of type-I quasars - see 

Fig. m 

In this paper we show that a narrow-band imaging sur- 
vey such as J-PAS will detect quasars in sufficiently high 
numbers (~ 2. x 10 6 up to z ~ 5), and with more than suffi- 
cient redshift accuracy, to make precision measurements of 
the power spectrum. In particular, these observations will 
yield a high-redshift measurement of BAOs, at an epoch 
where redshift distortions and nonlinearities are much less 
of a nuisance than in the local Universe. This huge dataset 
may also allow precision measuremen ts of the quasar lumi- 
nosity function [Hopkins et a l. (2007)], clustering and bias 
[Shen et all (|2007| ); IRoss et all |2009l ); IShen et~all j201Cf)l, as 
well a s limits on the quasar duty cycle [Martini fc Weinberg 
(III])]. 

This paper is organized as follows: In Section II we show 
how narrow (~ 100 A bandwidth) filters can be used to ex- 
tract redshifts of quasars with high efficiency and accuracy. 
We compare two photo-z methods: empirical template fit- 
ting, and the training set method. Still in Section II, we 
study the issues of completeness and contamination. In Sec- 
tion III we compute the expected number of quasars in a 
flux-limited narrow-band imaging survey, and derive the un- 
certainties in the power spectrum that can be achieved with 
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Figure 2. Three SDSS quasars as they would be observed by 
the filter system of Fig. [T] The SDSS objects are, from top to 
bottom: J000143. 41-152021. 4 (z = 2.638), J001138.43-104458.2 
(at z = 1.271), and J002019.22-110609.2 (z = 0.492). The light 
(blue in color version) curve indicates the flux (in units of 10 — 17 
erg/s/cm 2 /A) in spectral bins of the original SDSS spectra; the 
large (red) dots denote the corresponding fluxes (normalized by 
the filter throughput) for the J-PAS narrow-band filters. Some 
emission lines can be seen in the photometric data: Ly-ct, Si IV, 
C IV and C III] for the spectrum on top; C III] and Mg II for 
the quasar in the central panel; and Mg II, H7 and H/3 (together 
with the [O III] doublet) for the spectrum on the bottom. 



that catalog. Our fiducial cosmological model is a flat ACDM 
Universe with h = 0.72 and Q m = 0.25, and all distances 
are comoving, unless explicitly noted. 

As we were finalizin g this work, a closely related 
preprint, ISawangwit et all |201ll ). came to our notice. In 
that paper the authors analyze the SDSS, 2QZ and 2SLAQ 
quasar catalogs in search of the BAO features - see also 
lYahata et all (|2005h for a previous attempt using only the 
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SDSS quasars. Although Sawangwit et al. are unable to 
make a detection of BAOs with these combined catalogs, 
they have forecast that a spectroscopic survey with a quar- 
ter million quasars over 2000 deg 2 would be sufficient to 
detect the scale of BAOs with accuracy comparable to that 
presently made by LRGs - but at a higher redshift. Their 
conclusions are consistent with what we have found in Sec- 
tion 111 of this paper. 



2 PHOTOMETRIC REDSHIFTS OF QUASARS 

The idea of using the fluxes observed through multiple fil- 
ters, instead of full-fledged spectra, to estimate the red- 
shifts of astr onomical objects, is almost five decades old 
|Bauml (|l962T )]. but only recently it has acquired greater 
relevance in co n nectio n with photometri c galax y surveys 



Connolly et all d 19951) 
'2000l);lBlake fc Bridle 



2009)]. In fact, 



Bolz onella et al 
20051 ) 



iFirth et al 



ly plan ned astrophysical surveys such 
as PE S lAbbott et all (|2005l )]. Pan-STARRS and the LSST 
|Abelll (|2009l )] are relying (or plan to rely) almost entirely 
on photometric redshifts (photo-z's) of galaxies for the bulk 
of their science cases. 

Photometric redshift methods can be divided into two 
basic groups: empirical template fitting methods , and train- 
ing set methods - see, however, iBudavaril (|2009l ) for a uni- 
fying scheme. With template-based m ethods [which may 
includ e spectral synthesis methods, e.g. iBruzual fc Charlotl 
(|2003l )] the photometric fluxes are fitted (typically through 
a x 2 ) to some model, or template, which has been properly 
redshifted, and the photometric redshift (photo-z) is given by 
a maximum likelihood estimator (MLE) . In the training set 
approach, a large number of spectra is used to empirically 
calibrate a multidimensional mapping between photometric 
fluxes and redshifts, without explicit modeling templates. 

The performance of template fitting methods and of 
training set methods are similar when they are a pplied to 
broad- band photometric surveys [Budavaril (|2009l )]. In this 
paper we have taken both approaches, in order to compare 
their performances specifically for the case of a narrow-band 
filter surveys of quasars. 



2.1 The spectroscopic sample of quasars 

We have randomly selected a sample of 10,0 00 quasars from 
the compilation of ISchneider et al. 1 |2010l ) of all spectro- 
scopically confirmed SDSS quasars, that lie in the Northern 
Galactic Cap, that have an i-band magnitude brighter than 
20.4, and that have low Galactic ex tinction, as determined 
by the maps of lSchlegel et al.l (|1998l ). Avoiding the Southern 
Galactic Cap means that the sample does not contain the 
various "special" samples of quasars targeted on the Celesti al 
Equator in the Fall sky |Adelman-McCarthv et all (|200rj )]. 
which tend to be more unusual, fainter, or less representative 
of the quasar population as a whole. The magnitude limit 
also removes those objects at lowest signal-to-noise ratio. In- 
deed, the vast majority of the 10 4 objects are se l ected using 
the uniform criteria described bv lRichards et ah! (|2002l ). The 
SEDs of these objects were measured in the interval 3,793 
k < X < 9,221 A, with a spectr al resolution of R ~ 2,000 
and accurate spectrophotometry [Adelman-McCarthv et all 



|2008bl )]. The number of quasars as a function of redshift in 
our sample is shown in the left (red in color version) bars of 
Fig. [3] and reproduces the redshift distribution of the SDSS 
quasar catalog as a whole rather well. 

Starting from the spectra of our sample, we constructed 
synthetic fluxes using the 42 transmission functions shown 
in Fig. [1] The reduction is straightforward: the flux is ob- 
tained by the convolution of the SDSS spectra with the filter 
transmission functions: 



— / T a (X)S p (X)dX 



fa(p) 



where f a ( P ) is the flux of the object p measured in the 
narrow-band filter a, T a is the transmission function of the 
filter a, n a — J T a (X)dX is the total transmission normal- 
ization, and S p is the SED of the object. The noise in each 
filter in obtained by adding the noise in each spectral bin in 
quadrature. 



2.2 Simulated sample of quasars 

The procedure outlined above generates fluxes with errors 
which are totally unrelated to the errors we expect in a 
narrow-band filter survey. The magnitude depths (and the 
signal-to-noise ratios) of the original SDSS sample are char- 
acteristics of that instrument, and corresponds to objects 
with i < 19.1 for z < 3.0, and i < 20.2 for z > 3. However, 
we want to determine the accuracy of photo-z methods for 
a narrow-band survey that reaches i ~ 23. Hence, we need a 
sample which includes, on average, much less luminous ob- 
jects than the SDSS catalog does. It is easy to construct an 
approximately fair sample of faint objects from a fair sample 
of bright objects, as long as the SEDs of these objects do 
not depend strongly on their luminosit ies - which seems to 
be the case for quasars [Baldwin! |l977f )]. 

We have used our original sample of 10,000 SDSS 
quasars described in the previous Section to construct a 
simulated sample of quasars. For each object in the origi- 
nal sample with a magnitude i we associate an object in the 
simulated sample of magnitude i„, given by: 



= 14 + 1.5(i - 14) 



(1) 



Since the original sample had objects with magnitudes i ~ 
14 — 20.5, the simulated sample has objects ranging from 
i 3 ~ 14 to i s ~ 23.5. The distribution of quasars as a func- 
tion of their magnitudes, in the original and in the sim- 
ulated samples, are shown in Fig. [4] Clearly, Eq. JTJ still 
reproduces the selection criteria of the original SDSS sam- 
ple, which is evidenced by the step-like features of the his- 
tograms shown in Fig. [4] However, in this Section we are 
not as concerned with the number of quasars as a function 
of redshift and magnitude (which we believe are well repre- 
sented by the luminosity function that was employed in the 
previous Section), but with the accuracy of the photomet- 
ric redshifts and the fraction of catastrophic outliers - i.e., 
the instances when the photometric redshifts deviate from 
the spectroscopic redshifts by more than a given threshold. 
While we have not detected any significant correlations be- 
tween the absolute or relative magnitudes and the accuracy 
of the photo-z's, we have found that the number of photo-z 
outliers is higher for the simulated sample, compared with 
the original sample, which means that the rate of outliers 
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Figure 3. Redshift distribution of our full sample of quasars, in terms of their spectroscopic redshifts z s (left bars, red in color version) 
and their photometric redshifts z v obtained through the template fitting method of Section 12.31 (right bars, blue in color version), in bins 
of Az = 0.25. Left panel: sub-sample of SDSS quasars; right panel: simulated sample of fainter objects. 
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Figure 4. Distribution of magnitudes of the objects in our orig- 
inal sample (light bars) and in the simulated sample (dark bars.) 




Figure 5. Estimated limiting magnitudes (5c) for J-PAS with 
an aperture of 2", assuming a read-out noise of 5e/pixel. 



does depend to some extent on the actual magnitudes of the 
sample. This is discussed in detail in Section [2. 31 

In order to generate realistic signal-to-noise ratios 
(SNR) for the objects in this simulated sample, we also need 
to specify the depths of the survey that we are considering, 
in each one of its 42 niters. The 5a magnitude limits that 
we have estimated for J-PAS, considering the size of the 
telescope, an aperture of 2 arcsec, the median seeing at the 
site, the total exposure times for an 8,000 deg 2 survey over 4 
years, the presumed read-out noise, filter throughputs, night 
sky luminosity, lunar cycle, etc., are shown in Fig. [5] 

Our model for the signal-to-noise ratio (SNR) in each 
filter, for simulated quasars of a given i-band magnitude i s , 
is the following: 



SNR(a) = 5^W 0AWa '>-^ 

fi 



(2) 



where fi is the average flux of that object in the 10 filters 
(7,100 < A < 8,100) that overlap with the i-band; f(a) is 
the flux in filter a; d(a) is the 5a depth of filter a from Fig. 
and i s is the (simulated) i-band magnitude of that object. 
This model assumes that the intrinsic photon counting noise 
of the quasar is subdominant compared to other sources of 
noise such as the sky or the host galaxy. In order to obtain 



the desired SNR in our simulated sample, we have added a 
white (Gaussian) noise to the fluxes of the original sample, 
such that the final level of noise is the one prescribed by Eq. 
©■ 



2.3 Photometric redshifts of quasars: Template 
Fitting Method 

Conceptually, fitting a series of photometric fluxes to a tem- 
plate is the simplest method to obta in redshi f ts from objects 
that belong to a given spectral class [Ben itcz (200(3)]. Type-I 
quasars possess a (double) power-law continuum that rises 
rapidly in the blue, and a series of broad (AA/A ~ 1/20 
- 1/10 FWHM) emission lines - see Fig. [2] At high red- 
shifts (z > 2.5) the Ly-a break (which is a sharp drop in 
the observed spectrum of distant quasars due to absorp- 
tion from intervening neutral Hydrogen) can be seen at 
A > 4,000 A, which lies just within the dynamic range of 
the filter system we are exploring here. These very distinct 
spectral features, which are clearly resolved with our filter 
system, allow not only the extraction of excellent photo- 
z's, but can also be used to distinguish quasars from stars 
unambiguously - see, e.g., the SDSS spectral templates, 
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lAdelman-McCarthv et"all J2008eJ ) . The COMBO-17 quasar 
catalog [Wolf et all (|2003bT )]~ has successfully employed a 
template fitting method not only to obtain photometric red- 
shifts, but also to identify stars and understand the com- 
pleteness and rate of contamination of the quasar sample. 

Here we will assume that all quasars have already been 
identified, and the only parameter that we will fit in our tests 
is the redshift of a given object. A more detailed analysis 
will be the subject of a forthcoming publication (Gonzalez- 
Serrano et al., 2012, to appear). 

Our baseline mo del for the quasar sp e ctra i s the Vanden 
Berk mean template (Vanden Berk et al.l l)200ll )]. which also 
includes the uncertainties due to intrinsic variations. We al- 
low for further variability in the quasar spectra by mean s 
of the global eigen-spectra computed by lYip et aD l|2004F ) . 
We use both the uncertainties in the Vanden Berk template 
and the Yip et al. eigen-spectra because they capture dif- 
ferent types of intrinsic variability: while the uncertainties 
in the template are more suited to allow for uncorrelated 
variations around the emission lines and below the Ly-a, 
the Yip et al. eigen-spectra allow for features such as con- 
tamination from the host galaxy (which is most relevant 
at low luminosities), UV-optical continuum variations, cor- 
related Balmer emission lines and other secondary effects 
such as broad absorption line systems. We search for the 
best-fit combination of the four eigen-spectra at each red- 
shift, by varying their weights (w PtZ , p = 1 • • • 4) in the 
interval — 3w p < w Pt z < 3w p , where w p is the weight of 
the p-th eigenvalue relative to the mean. The four highest- 
ranked global eigen-spectra have weights of wi = 0.119, 
W2 — 0.076, W3 = 0.066, and Wi — 0.028 relative to the 
mean template s pectrum (which has wo = 1 by definition) 
|Yip et al.l (|2004l )]. 

The eigen-spectra are included in the MLE in the fol- 
lowing way: first, we normalize the fluxes by their square- 
integral, i.e.: f a -> /" = /a/yXbLi fi 
number of filters (42 for J-PAS.) We then add the red- 
shifted eigen-spectra f P! a{z) to the average template [fo, a (z)] 
with weights w p (z), so that at each redshift we have /" = 
/oU + X^i w pfp,a- The weights w p are found by minimizing 
the (reduced) x 2 a t each redshift: 



2 , where N is the 



1 



[/" -/"wi 2 



(3) 



where /" (i) are the fluxes from some object i in our sample 
of SDSS quasars, and a 2 (i,z) is the sum in quadrature of 
the flux errors and of the (2-cr) uncertainties in the quasar 
template spectrum for that filter. We have not marginalized 
over the weights of the eigen-spectra - i.e., the method is 
indifferent as to whether or not the best fit to an object at a 
given redshift includes an unusually large contribution from 
some particular eigen-mode. 

It is also interesting to search for the linear combination 
between the fluxes that leads to the most accurate photo- 
z's. We could have employed either the fluxes themselves 
or the colors (flux differences) for the procedure that was 
outlined above - or, in fact, any linear co mbination of th e 
fluxes. Most photo-z methods employ col o rs iBemtezI (1200011 ; 
iBlake fc Bridie] (|2005l ): lFirth et all (|2003ft ; lBudavaril (|2009l ')]. 
since this seems to reduce the influence of some system- 
atic effects such as reddening, and it also eliminates the 



need to marginalize over the normalization of the observed 
flux. We have tested the performance of the template fit- 
ting method using the fluxes f a , the colors Af a — fa — fa-i 
(the derivative of the flux), and also the second differences 
A 2 f a = fa+i — 2/ a + f a -i (the second derivative of the 
flux, or color differences.) We have noticed a slightly better 
performance with the latter choice (A 2 f a ) when compared 
with the usual colors (A/ a ), but the difference is negligible 
and therefore in this work we have kept the usual practice 
of using colors. The results shown in the remainder of this 
Section refer to the traditional template-fitting method with 
colors. 

In Fig. [6] we plot the distribution of log 10 \ 2 (f° r the 
best-fit \ 2 among all z's) for our sample of 10 4 quasars. The 
wide variation in the quality of the fit is partly due to the 
small number of free parameters: we fit only the redshift and 
the weights of the four eigen-modes. 

Once the X 2 ( z ) has been determined for a given object, 
we build the corresponding posterior probability distribution 
function (p.d.f.): 



I \ -X 2 0)/2 

p(z) oc e *- v 



(4) 



The photometric redshift is the one that minimizes the x 
(the MLE.) 

Finally, we need to estimate the "odds" that the photo- 
z of a given object is accurate. Due to the many possi- 
ble mismatches between different combinations of the emis- 
sion lines, the p.d.f. 's are highly non-Gaussian, with multi- 
ple peaks (i.e., multi-modality.) Hence, we have employed 
an empirical set of indicators to assess the quality of the 
photo-z's. These empirical indicators are: (i) the value of 
the best-fit x 2 \ ( n ) the ratio between the posterior p.d.f. 
p(z) at the first (global) maximum of the p.d.f. and the 
value of the p.d.f. at the secondary maximum (if it ex- 
ists), r = p m ax#i/Pmax#2; and (iii) the dispersion of the 
p.d.f. around the best fit, a = f(z — zt e st) 2 p{z)dz. We 
then maximize the correlation between the redshift error 
\z p — z s |/(1 + z s ) and a linear combination of simple func- 
tions of these indicators. Finally, we normalize the results 
so that they lie between (a very bad fit) and 1 (very good 
fit.) 

For the original SDSS sample, we found empirically that 
the combination that correlates (positively) most strongly 
with the photo-z errors (the quality) is given by: 

(5) 

(6) 
1 with 



1.4<T 



q = 0.15 log(0.7 + xlf) + e 8(r_1) + 0.06 e 
For the simulated sample, the quality indicator is: 



q=0.31og(0.6 + x?/)+e 



15(r-l) 



+ 0.026 e" 



Finally, we compute the quality factor < q < 
the formula: 

<7 = 



(7) 

max(g) — min(g) 

where the power of 4 was introduced to produce a "flatter" 
distribution of bad and good fits (this step does not affect 
the photo-z quality cuts that we impose below). 

The relationship between the quality factor and the 
photometric redshift errors is shown in the distributions of 
Fig. [7] There is a strong correlation between the quality fac- 
tor and the rate of "catastrophic errors", which we define 
arbitrarily as any instance in which \z p — z s |/(1 + z s ) > 0.02 
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2000 r . 




Figure 6. Histogram of the best-fit reduced x 2 for the sample of 10 4 quasars from the SDSS spectroscopic catalog. Left panel: original 
SDSS sample limited at i < 20.1; right panel: simulated sample, effectively limited at i < 23.5. We point out that the distributions above 
are not at all typical of a x 2 probability distribution function - the horizontal axis is in fact log 10 \ 2 ■ 



- denoted as the horizontal dashed lines in Fig. We have 
adopted the usual convention of scaling the redshift errors 
by 1 + z, since this is the scaling of the rest-frame spec- 
tral features. There is no obvious reason why emission-line 
systems (whose salient features can enter or exit the filter 
system depending on the redshift) should also be subject 
to this scaling, but we have verified that the scatter in the 
non-catastrophic photo-z estimates do indeed scale approx- 
imately as 1 + z. 

We have divided our sample into four groups with an 
equal number of objects, according to the value of q: lowest 
quality (gi, 2,500 objects), medium-low quality (g2, 2,500 
objects), medium-high quality (33, 2,500 objects) and high- 
est quality (54, 2500 objects) photo-z's. These grade groups 
are separated by the vertical dotted lines shown in Fig. [Jj 
For the original sample, the rate of catastrophic redshifts is 
16.9 %, 0.08 %, % and % in the grade groups gi, g2, gz 
and gi, respectively. For the simulated sample, the rate of 
catastrophic errors is 44.7 %, 2.3 %, 0.001 % and % in the 
groups gi, (72, <?3 and (?4, respectively. 

The relationship between spectroscopic and photomet- 
ric redshifts is shown in Fig. [5] where each quadrant corre- 
sponds to a grade group. Almost all the catastrophic redshift 
errors are in the 171 grade group, and most of the catastrophic 
errors lie below z v < 2.5 - since it is above this redshift that 
the Ly-a break becomes visible in our filter system. 

From Figs. [7J and [8] it is clear that the rate of catas- 
trophic photo-z's is larger for the simulated sample, which 
has an overall fraction of approximately 12% of outliers, 
compared to the original sample, which has a total frac- 
tion of 4% of outliers. A similar increase happens also when 
the Training Set method is applied to these samples (see the 
next Section). Since the simulated sample used in this Sec- 
tion was not designed to reproduce the actual distribution of 
magnitudes expected in a real catalog of quasars, this means 
that our results for the rate of outliers are only an estimate 
for the actual rate that we should expect from the final J- 
PAS catalog. However, even as the rate of outliers increases 
from the original to the simulated samples, the accuracy of 
the photo-z's are still very nearly the same. This means that 
the actual distribution of magnitudes of an eventual J- PAS 
quasar catalog should have little impact on the accuracy of 



the photo-z's - although it could affect the completeness and 
purity of that catalog. 

A further peculiarity of the quasar photo-z's is evident 
in the lines z p = z* + az s , which are most prominent in 
the gi groups of the original and simulated samples, as well 
as the g2 group of the simulated sample. Whenever two (or 
more) pairs of broad emisson lines are separated by the same 
relative interval in wavelength, i.e. Aq/A^ ~ A 7 /Ai, (where 
X a ...s are the central wavelengths of the emission lines), there 
is an enhanced potential for a degeneracy of the fir between 
the data and the template - i.e., additional peaks appear in 
the p.d.f. p(z). As the true redshift of the quasar change, the 
ratios between these lines remain invariant, and so the ratios 
between the true and the false redshifts, (1 + zt rue )/(l + 
z false), also remain constant, giving rise to the lines seen in 
Fig. [H] The degeneracy is broken when additional emission 
lines come into the filter system, which explains why some 
redshifts are more susceptible to this problem. 

The median and median absolute deviation (mad) of the 
redshift errors in each grade groups are shown in Fig. [9] for 
the original (left panel) and simulated (right panel) samples. 
For the lowest quality photo-z's (grade group 1), the median 
for the original sample of quasars is med[\z p — z a \/(l + z 3 )] = 
0.0019, and the deviation is mad[\z p — z s \/(l + z s )] = 0.0014, 
which is very small given the high level of contamination 
from outliers - 12% for that group. For the simulated sample 
the redshift errors are much larger: the median and median 
deviation for group 1 are 0.0073 and 0.0069, respectively 
- which is not surprising given that the number of catas- 
trophic photo-z's is 44.7%. However, for the grade group 
2 the median and median deviation for the original sample 
falls to 0.001 and 0.0007, respectively. More importantly, for 
the simulated sample the median and deviation are 0.0014 
and 0.001, respectively. The accuracies of the photo-z's for 
the grade groups 3 and 4 are slightly higher still. 

An alternative metric to assess the accuracy of the 
photometric redshifts is to manage the sensitivity to catas- 
trophic outliers with the following method. First, we com- 
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Figure 7. 2D histograms of the photo-z errors log 10 \z p — z s \/(l + z s ) (vertical axis) and the quality factor q (horizontal axis). The left and 
right panels correspond to the original and the simulated samples, respectively. The catastrophic redshift errors [\z p — s s |/(1 + z s ) > 0.02] 
lie above the horizontal dashed (red in color version) line. The quality factor has been grouped into four "grades", from grade=l to 
grade=4, according to the vertical dashed (green in color version) lines. 
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Figure 8. Scatter-plots of spectroscopic redshifts (horizontal axis) versus photometric redshifts (vertical axis) obtained with the template 
fitting method, for the four quality grade groups (1, 2, 3 and 4). Left panel: original sample; right panel: simulated sample. There are 
2,500 objects in the group g± (first quadrant in the upper right corner, red dots in color version); 2,500 objects in the group 32 (second 
quadrant and green dots); 2,500 objects in the group 33 (third quadrant and blue dots); and 2,500 objects in the group 34 (fourth 
quadrant and black dots). The radial lines in the gi group correspond to degenerate regions of the z p — z a mapping. There are virtually 
no catastrophic errors for z v > 2.5 objects in the 321 93 and 34 grades in the simulated samples. 




Grade group Grade group 

Figure 9. Median (med) and median absolute deviation (mad) of the errors in the photometric redshifts obtained with the template 
fitting method. Left panel: original sample of SDSS quasars; right panel: simulated sample. The circles (black in color version) denote 
the medians for each grade group; squares (brown in color version) denote the mad. 
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pute the tapered (or bounded) error estimator defined by: 



l + z 



S z tanh 



N ^ 



5, tanh 



S z l + z 
1 Zp(i) - z s (i) 



(8) 



all 
1 2 



5 Z l + z s (i) 



where 8 Z — 0.02 in our case. For accurate quasar photo-z's 
(z p m z s ) with minimal contamination from outliers, this er- 
ror estimator yields the usual contribution to the rms error, 
while for samples heavily influenced by catastrophic photo- 
z's, this estimator assigns a contribution which asymptotes 
to our threshold S z . 

Second, we compute the purged rms error, summing 
only over the non-catastrophic photo-z's: 



l + z 



1 



JV™ C 

£ 

i=l 



'z p (i) - z s {i)f 
[1 + *.(<)]" 



(9) 



The estimators ([H])-© are therefore complementary: the ta- 
pered error estimator is indicative of the rate of catastrophic 
errors, while the purged rms error is a more faithful repre- 
sentation of the overall accuracy of the method for the bulk 
of the objects. The results for the two estimators of the pho- 
tometric redshift uncertainties are shown in Fig. 1101 for the 
four grade groups. The two estimators are in good agree- 
ment for the groups 172, 33 and <?4, which is again evidence 
that the rate of catastrophic photo-z's is negligible for these 
groups. 

Thus, we conclude that with the template fitting 
method alone it is possible to reach a photo-z accuracy bet- 
ter than \z s — z p \/(l + z s ) ~ 0.0015 for at least ~ 75% of 
quasars, even for a population of faint objects (our simulated 
sample), with a very small rate of catastrophic redshift er- 
rors. In fact, the average accuracy given by the median and 
median deviation errors is already of the order of the in- 
trinsic error in the spect roscopic redshifts due to line shifts 
|Shen et all (|2007l . l2010l )]. This means that, with filters of 
width ~ 100 A (or, equivalently, with low-resolution spec- 
troscopy with R ~ 50) we are saturating the accuracy with 
which redshifts of quasars can be reliably estimated - al- 
though, naturally, with better resolution spectra and larger 
signal-to-noise the rate of catastrophic errors would be even 
smaller. 

It is useful to compare the results of this sec t ion wit h 
those of the COMBO-17 quasar sample [Wolf et all (|2003bl )]. 
That catalog, which employs five broad filters (ugriz) and 12 
narrow-band filters, attains a photo-z accuracy of a z — 0.03 
- the sa me that was also obt ained for the COMBO-17 galaxy 
catalog [Wolf et al.l (1200331 )]. The accuracy that we obtain 
for quasars with the 42 contiguous narrow-band filters is 
also of the same order as that which is obtain ed for red 
and emission-line galaxies [Bemtez et al.l (|2009l )]. Clearly, 
the gains in photo-z accuracy are not linear with the width 
of the filters, and the issue of continuous coverage over the 
entire dynamic range also plays in important role. 

Finally, in order to understand how the photometric 
depth relates to photo-z depth, it is useful to compare the 
photo-z quality indicator for each object to the i-band mag- 
nitude of the simulated sample, i s , as well as the dependence 
of the actual photo-z errors with i„. The magnitude is di- 
rectly related to the SNR through Eq. ([2]). From the left 



panel of Fig. [TT] (which should also be compared to the right 
panel of Fig. [7} we see that the quality indicator declines 
steeply for the faintest objects in the simulated sample. From 
the right panel of Fig. [11] we see that the actual photo-z er- 
rors (which are plotted on an inverted scale) also depend 
on the magnitude, but in this case even for the faintest ob- 
jects a substantial fraction of the quasars still have correctly 
estimated redshifts. This means that our quality indicator 
(which was calibrated for the full sample, independently of 
magnitude) is not very good at capturing the photo-z de- 
pendence for the faintest objects. Clearly, a more accurate 
analysis than the one we have implemented can be achieved 
by including the magnitudes as additional parameters for 
estimating the photo-z's. 



2.4 Photometric redshifts of quasars: Training Set 
Method 

Training methods of redshift estimation are partic- 
ularly well suited when a large and representative 



available IConnollv et all (119951); 


Firth et al.l (l2003h; 


ICsabai et al. 


J2003l); Collister & Lahav 


(l2004l);IOvaizu et all 


(l2008f): iBanerii et al. 1 (20081); iBonfield et all (20ld); 



be a fair sample of the photometric set of galaxies for 
which we want to estimate redshifts, reproducing its color 
and magnitude distributions. Whereas lack of coverage in 
certain regions of parameter space may imply significant 
degradation in photo-z quality, having a representative and 
dense training set can lead to a superior photo-z accuracy 
compared to template fits. 

Empirical methods use the training set objects to 
determine a functional relationship between photomet- 
ric observables (e.g. colors, magnitudes, types, etc.) and 
redshift. Once this function is calibrated, usually re- 
quiring that it reproduces the redshifts of the training 
set as well as possible, it can be straightforwardly ap- 
plied to any photometric sample of interest. This class 
of methods includes machine learn i ng te chniques such 
as nearest neighbors Csabai et al.l d2003f)l, loca l poly - 
nomial fits IConnollv et all (|l995l ); ICsabai et all (120031); 



lOvaizu et alj (|2008|)1. glob al neur al networks iFirth et all 
|2003l ); I Collister fc Lahavl d2004h; lOvaizu et alj (|2008l )]. 



and gaussian processes " [Bonfield et all (|2010l )]." They have 



also b een successful l y app lied to galaxy surveys, e.g. the 
SDSS [5i aizu et all (|200ST)1 . allowing further applications 
in cluster detection iDong et all (| 200STl| and weak lensing 
[Mandelbaum et all (|2008l ); ISheldon et al. I (|2009l )]. 

The training set can also be used to improve template 
fitting, using it either to generate good priors or for empirical 
calibration and/or determination of the templates by, e.g., 
PCA of the spectra. Training sets are usually necessary to 
assess the photo-z quality of a certain survey specification 
and for calibration of the photo-z errors, which ca n then be 
model e d and included in a cosmological analysis |Ma et all 
(|2006t ); lLima fc Hull|2007D ]. In this sense, it is the knowledge 
of the photo-z error parameters - and not the value of the 
errors themselves - that limit the extraction of cosmological 
information from large data sets. 

Here we implement a very simple empirical method, 
mainly to compare it with the template method presented 
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0.007 




Grade group Grade group 

Figure 10. Photo-z errors obtained with the template fitting method for each grade group: (i) circles (blue in color version): rms error 
excluding catastrophic redshift errors, cf. Eq. [9] and (ii) squares (red in color version): rms tapered error including catastrophic redshift 
errors, cf. Eq. \E\ When these two quantities coincide, the fraction of catastrophic photo-z's has become negligible. 




photo— z quality factor -^-ogio\z s -Zp\/(.l+z s ) 

Figure 11. 2D histogram of the simulated sample, showing the magnitude in the i-band versus the photo-z quality indicator (left panel), 
and the magnitude versus the photo-z error on an inverted scale (right panel). These plots should be compared with the right panel of 
Fig. El 



in the previous Section. We use a simple nearest neighbor 
(NN) method: for each photometric quasar, we search the 
training set for its nearest neighbor in magnitude space, and 
then assign that neighbor's spectroscopic redshift as the best 
estimate for the photo-z of the photometric quasar. We de- 
fine distances with an Euclidean metric in multidimensional 
magnitude space, such that the distance dij between objects 
i and j is: 

JV 

dij = ^2(m.i -m]f , (10) 

a=l 

where N = 42 is the number of narrow filters and m" is the 
a th magnitude of the i th object. The nearest neighbor to a 
certain object i is then simply the object j for which dij is 
minimum. 

We computed photo-zs in this way for all 10 4 quasars 
in the catalog. For each quasar, we took all others as the 
training set. In this case, there is no need to divide the ob- 
jects into a training and photometric set, because all that 
matters is the nearest neighbor. 

We can also use knowledge of the distance between the 
nearest neighbor and the second-nearest neighbor to assign 
a quality to the photo-z's obtained with the training set 
method. The idea is that the quality of the photo-z is related 
to how sparse the training set is in the region around any 
given object. The original and simulated samples were then 



divided into four groups of increasing density (i.e., decreas- 
ing sparseness), as we did for the template fitting method. In 
Fig. [12] we show the photo-z's as a function of spectroscopic 
redshifts for the original sample of quasars (left panel), and 
for the sample simulated with J-PAS specifications (right 
panel), for the four quality groups. 

The results for the median and median deviation of \z s — 
z p \/(l + z s ) are shown in Fig. 1131 Although the fraction of 
outliers for groups 2-4 is roughly the same (at the level of 
2-3%), the median and the median deviation of the photo-z 
errors are clearly correlated with the density of the training 
set. Comparing with Fig. [9] we see that the training set has 
a lower accuracy than the template fitting method - both 
the median and the median deviation of the training set 
groups are about twice as large as those of the template 
fitting groups. 

The rms error after removing catastrophic objects with 
S z > 0.02(1 + z) is, for the original sample, a™ c /(l + z) = 
0.035, 0.001, 0.0016 and 0.0037 for the sparseness bins 1-4. 
For the simulated sample the rms errors after eliminating 
the outliers are cr" c /(l + z) = 0.082, 0.0045, 0.0045 and 
0.007 for the sparseness bins 1-4. For the photo-z groups 2, 
3 and 4, the errors as measured by this criterium are about 
2-3 times as large as the ones obtained with the template 
fitting method (see Fig. [10)) . 

We expect these results to improve significantly if we 
employ a denser training set. With the relatively sparse 
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Figure 12. Scatter-plots of spectroscopic redshifts (horizontal axis) versus photometric redshifts (vertical axis) obtained with the 
training set method, for the four groups of decreasing sparseness (1, 2, 3 and 4, in decreasing sparseness). Left panel: original sample; 
right panel: simulated sample. As before, there are 2,500 objects in the first group (first quadrant in the upper right corner, red dots in 
color version); 2,500 objects in the second group (second quadrant and green dots); 2,500 objects in the third group (third quadrant and 
blue dots); and 2,500 objects in the fourth group (fourth quadrant and black dots). 
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Figure 13. Median (med) and median absolute deviation (mad) of the errors in the photometric redshifts for the training set method. 
Left panel: original sample of SDSS quasars; right panel: simulated sample. The circles (black in color version) denote the medians for 
each grade group; squares (brown in color version) denote the mad. 



training set used here, we do not expect complex empiri- 
cal methods to improve the photo-z accuracy. For instance, 
we have tried to use the set of the few nearest neighbors of 
a given object to fit a polynomial relation between magni- 
tudes and redshifts, which we then applied to estimate the 
redshift of the photometric quasar. The results of such pro- 
cedure were similar but slightly worse than simply taking 
the redshift of the nearest neighbor. That happens because 
our quasar sample is not dense enough to allow for stable 
global - and even local - fits. 

With a sufficiently large training set, it has been shown 
that global neural network fits produce photo-z's of sim- 
ilar accuracy to th ose obtained by local polynomial fits 
[Ovaizu et alj (|2008l )]. However these used a few hundred 
thousand training set galaxies spanning a redshift range of 
[0,0.3] whereas here we have f0 4 quasars spanning the red- 
shift range [0,5]. 



2.5 Comparison of the template fitting and 
training set methods 

We have seen that the two methods for extracting the red- 
shift of quasars, given a low-resolution spectrum, yield errors 
of the same order of magnitude. Both the template fitting 
(TF) and the training set (TS) methods also yield empirical 
criteria for selection of potential catastrophic redshift errors 
(the "quality factor" of the photo-z, in the case of the TF 
method, and the distance between nearest neighbors in the 
case of the TS method), which allows one to improve purity 
at the price of reducing completeness. 

A larger sample of objects (the entire SDSS spectro- 
scopic catalog of quasars, for instance, has ~ 10 5 objects, 
instead of the ~ f 4 that we used in this work) would im- 
prove the performance of the TS method significantly, but 
may not necessarily make the performance of the TF method 
much better. A larger sample means a denser training set, 
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which will certainly lead to better matches between nearby 
objects, as well as a better overall accuracy. From the per- 
spective of the TF method, a larger sample only means a 
larger calibration set, and with our sample the performance 
of the method is already being driven not by the calibration, 
but by intrinsic spectral variations in quasars - something 
that the TS method is perhaps better suited to detect. 

We have also applied a hybrid method to improve the 
quality of the photo-z's even further, by combining the power 
of the TF and TS methods in such a way that one serves 
to calibrate the other. The method was implemented for the 
simulated sample of quasars in the following manner. First, 
we eliminate the 10% worst photo-z's from the samples of 
quasars, either by using the quality factor, in the case of 
the TF method, or by using the distance between nearest 
neighbors, in the case of the TS method. This procedure 
alone reduces the median of the errors, Az/(l + z), to 0.0014 
(TF) and 0.0024 (TS), and reduces the fraction of outliers 
to 5% (TF) and 4% (TS). 

The next step is to flag as potential outliers all objects 
which have been rejected by either one of the 10% cuts, and 
to eliminate them from both samples - i.e., objects rejected 
by one method are also culled from the sample that survives 
the cut from the other method. The result is a culled sample 
containing about 83.6% of the initial 10 4 objects. In that 
sample, the fraction of outliers is further reduced to 3.5% 
(TF) and 2.6% (TS). 

The final step is to compare the two photo-z's in the 
culled sets and flag those that differ by more than a certain 
threshold, namely \ztf — «ts|/[1 + 0.5(ztf + zts)] = 0.02. 
After removing the flagged objects we still retain about 80% 
of the original sample (8001 quasars), but the fraction of 
outliers falls dramatically, to 0.6% (47 objects out of 8001). 
The median error for this final sample is 0.0013 (TF) and 
0.0023 (TS), and the median deviation is 0.00084 (TF) and 
0.0014 (TS). 

Hence, the combination of the TF and TS methods 
can yield 80% completeness with 99.4% purity, and quasar 
photo-z errors which are as good as the spectroscopic ones. 
The histogram in Fig. [2] illustrates how this hybrid method 
is able to identify the outliers, and Table [T] shows how the 
performance of the photo-z estimation is enhanced by the 
successive cuts. Although the TS method is slightly better 
than the TF method at identifying the outliers, it is sig- 
nificantly worse in terms of the accuracy of the photo-z's. 
However, the performance of the TS method should improve 
with a larger (and therefore denser) training set. 

As a final note, there are a few important factors that 
we have not considered, which may affect the performance 
of the quasar photo-z's. One of them is the calibration of the 
filters, which, if poorly determined, could introduce fluctua- 
tions of (typically) a few percent in the fluxes. Since J-PAS 
uses a secondary, 0.8 m aperture telescope dedicated to the 
calibration of the filter system, the stated goal of reaching 
3% global homogeneous calibration seems feasible - and, in 
fact, we employed that lower limit for the noise level of our 
simulated quasar sample. An even more important factor is 
the time variability of the intrinsic SEDs of quasars, which 
can be a much larger effect than the fluctuations induced 
by calibration errors. Since a final decision concerning the 
strategy of the J-PAS survey has not yet been reached at 
the time this paper was finished, we decided not to pursue 



a simulation that took variability into account. However, it 
seems likely that each quasar that is observed by J-PAS will 
have several (7 or more) adjacent filters measured during 
an interval of a few (4-10) days, at most, and the full SED 
will be represented by a few (4-8) of these snapshots. In that 
sense, the information in the time domain contained by these 
snapshots would not be simply a nuisance, but may be used 
to aid in the identification of the quasars. 



2.6 Completeness and contamination 

In order to understand how a quasar sample produced from 
an optical narrow-band survey could be contaminated by 
other types of objects (stars, mostly), we have used data 
from SDSS spectroscopic plates in which a random subsam- 
ple of all point sources with i < 19 had their spectra taken 
[Adelman-McCarthv et al] l|2006h ]. We randomly extracted 
10 4 stars from this catalog, and processed their spectra using 
the template fitting method that was outlined in the previ- 
ous subsections for the SDSS simulated quasars. However, 
we do not include any star templates in our fitting proce- 
dure, so the only questions we are asking are: (i) what is 
the redshift which best fits the quasar template spectrum 
to the spectra of each individual star, and (it) what are the 
qualities of those fits (their reduced x 2 )? 

Using the tools which were introduced in Section 2.3 we 
were able to reject the overwhelming majority of stars, just 
on the basis of their poor reduced \ 2 fits to the quasar tem- 
plate, and the degeneracy in their photo-z's as measured 
by the parameter r of Section 2.3 (stars lack the quasar's 
emission- line features, which are the key determinants of the 
photo-z's, and this translates into high values of r). Hence, 
it is clear that, in this sense, stars are quite segregated 
from quasars - and the introduction of stellar templates 
would further improve this separation. As a com parison, the 
COMBO- 17 quasar catalog (Wolf et alj (|2003bl )] doesn't suf- 
fer from significant contamination from stars, even though 
it has a lower spectral resolution than J-PAS, and similar 
depths. Hence, we conclude that the prospects of J-PAS 
achieving high levels of purity and completeness are quite 
good - however, we cannot definitively answer this question 
here, and leave this critical issue to future work. 

Nevertheless, we can determine which redshift ranges 
are most likely to affect the completeness and purity of 
the quasar sample due to contamination from stars. Fig. 
1151 shows that the photo-z's falsely assigned to stars are 
concentrated in a few intervals, corresponding to redshifts 
where the visible region of the quasar spectra present few 
distinguishing features. The concentration of false photo-z's 
in narrow intervals is starker for those stars whose spectra 
can most easily be confused with those of quasars - which, 
for the purposes of this exercise, are stars whose fits to the 
quasar template satisfy both \ 2 < 3 an d r < 0.75 (approxi- 
mately 1% of the total). Some of these problematic redshift 
intervals also contain a large proportion of the catastrophic 
photo-z's for the true quasars (see the right panel of Fig. 
I15|> . These plots indicate that contamination should be a 
greater concern for the redshift intervals 1.30 < z p < 1.31, 
2.2 < z p < 2.22 and 2.65 < z p < 2.7. 
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Figure 14. Histograms of the photo-z errors for the simulated sample of quasars. The left and right panels correspond to the template 
fitting (TF) and training set (TS) methods, respectively. The first quality cut (i.e., the quality factor in the case of the TF method, and 
the distance between nearest neighbors in the case of the TS method) reduces the full sample of 10 4 quasars by 15% (upper bars, red in 
color version). The second cut, obtained by comparing the photo-z's from each method, further reduces the number of quasars to 80% 
of the full sample (i.e., 8001 objects). The rate of outliers in this final sample is approximately 0.6% - see Table[TJ 



Table 1. Completeness (fraction of objects that remain after applying the cuts), purity (fraction of objects after culling the outliers) 
and accuracy of the photo-z's for the simulated sample of quasars. The first step eliminates the 10% worst-quality photo-z's in both 
techniques, producing the samples TFgo and TSgo. The second step keeps only those objects which are present both in TF90 and in 
TSgo, producing the samples TF C and TS C . The last step is to compare the photo-z's that were obtained with the different techniques, 
and to flag those that differ by more than the threshold Az/(1 + z) > 0.02 as potential outliers. 



Method 


Completeness (%) 


Purity (%) 


median [ Az/(1 + z) ] 


TF 90 = TF - TF10 


90 


95 


0.0014 


TS90 = TS - TS10 


90 


96 


0.0024 


TF C = TF90 - TS10 


85 


96.5 


0.0014 


TS C = TS90 - TF10 


86 


97.4 


0.0023 


TF C v. TS C 


80 


99.4 


0.0013 


TS C v. TF C 


80 


99.4 


0.0023 




Figure 15. Left panel: photo-z's assigned to stars by the quasar template fitting code, in 100 bins of redshift. The 1% stars with the 
highest potential to be confused with quasars (i.e., those whose spectra satisfy both \ 2 < 3.0 and r < 0.75) are shown as dark bars. 
Right panel: fraction of photo-z outliers [i.e., quasars whose photo-z's differ from the correct redshifts by more than 0.02(l+z)] for the 
simulated quasar catalog, in 100 bins of redshift. 
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3 QUASARS AS COSMOLOGICAL PROBES 

iRichards et all 



The SPSS sampl e of qu asars 



Vanden Berk et aU 
Yip et all (1200411; 



(2001 



Schneider et al l (120031); 
ISchneider et al. I J2007I); IShen et al l 



2001); 



20071); iRoss et al.l (|2009l ); ISchneider et al. I (|2010r i; 



IShen et all (|2010T )] has enabled a reliable 



measu r ement 
hards et all j2005l . 



of th e quasar lumino sit y fun cti on ^_ 

I2006T ); iHopkins et aU feoOTf l; ICroom et alj (|2009al lbl)]. 

which, in terms of the g -band absolute magnitude is given 
by the fit [Croom et all (|2009bl l]: 



<t>{M G ,z) = 



4>o 



10 0.4 (1+a) [-M G -M* (*)] + 10 0.4 (1+/3) [M G -M* (z)] ' 

(11) 

where 4>o = 1.57 x 10~ 6 Mpc" 3 , a = -3.33, j3 = -1.41 and 
the break magnitude expressed in terms of Mq is given by: 

Mg(z) = -22.2 - 2.5 (1.44 z - 0.32 z 2 ) . (12) 

Notice that the quasar luminosity function and the break 
magnitude were obtained with a sample of quasars only 
up to z ~ 2.5, and it is far from clear that these fits can 
be extrapolat e d to h igher redshifts and lower luminosities 
(Croom et ail (|2009bl )]. 

To obtain the number density of quasars as a function 
of some limiting (absolute) magnitude M G , the luminosity 
function above must be integrated up to that magnitude. 
In Fig. [TFJ] we plot the quasar volumetric density both in 
terms of the limiting apparent magnitude in the g band 
for flux-limited surveys, n(< gu m ) = J® llm dg <j>(g) (solid 
lines, gn m = 24, 23, 21 and 19, from top to bottom), and 
also in terms of the absolute magnitudes n(< Mo,iim) = 
I^T Um dM G 4>{M G ) (dashed lines, M G , lim = -20, -22, -24 
and -26 from top to bottom.) Since contamination from the 
host galaxy may hinder our ability to identify low-luminosity 
quasars through color selection (this can be especially prob- 
lematic at low redshifts) , we chose to apply a cut in absolute 
magnitude in the luminosity function, in addition to the ap- 
parent magnitude cut. 

As a concrete example, we will discuss a flux-limited 
survey up to an apparent magnitude of g < 23, and in- 
clude only those objects which are more luminous than 
Mq < —22, since quasars fainter than this often have their 
light dominated by the host galaxy. The resulting comoving 
number density is shown as the dashed line and hashed re- 
gion in Fig. 1161 which peaks at z ~ 1.6 with n max ~ 10~ 5 
Mpc -3 (or ~ 3.10~ 4 h 3 Mpc" 3 .) If the limiting apparent 
magnitude is g < 24, the number density can be as large 
as 10 -4 h 3 Mpc~ 3 at z ~ 2. As we will see below, the rel- 
atively small density of quasars when compared to galaxies 
(which can easily reach n > 10 -3 Mpc -3 ) is compensated 
by the facts that quasars are highly biased tracers of large- 
scale structure, and that the volume that they span is larger 
than that which can be achieved with ga l axies - for a sim- 
ilar an alysis, see also IWang et al.l (|2009l ); TSawangwit et al.l 
|201ll ). 

It is also useful to compute the total number of quasars 
that a large-area (1/5 of the sky), flux- limited survey could 
produce - assuming the quasar selection is perfect. In Fig. 1171 
we show that an 8.4 x 10 3 deg 2 survey up to g < 23 (g < 24) 
could yield 2.0 x 10 6 (3.0 x 10 6 ) objects, up to z = 5. 




Figure 16. The volumetric density of quasars for different lim- 
iting g-band apparent magnitudes (solid lines) and different ab- 
solute magnitudes (dashed lines), as a function of redshift, com- 
puted according to the luminosity function of Croom et al. 2009. 
The solid lines, from top to bottom, correspond to limiting mag- 
nitudes of g < 24 (green in color version), 23 (yellow in color ver- 
sion), 21 (red in color version) and 19 (blue in color version); the 
short-dashed lines, from top to bottom, correspond to absolute 
luminosity cut-offs of Mq < -20, -22, -24 and -26 respectively. 



Log, N 




Figure 17. Total numbers of quasars in Az = 0.5 bins for an 
8.4 X 10 3 deg 2 survey, assuming a 5cr point-source magnitude limit 
of g = 23 (left bars, red in color version) and 24 (right bars, blue in 
color version.) The numbers are identical for z < 1.5 because our 
selection criteria culls the quasars fainter than Mq = —22, which 
means that for z < 1.5 the catalog is equivalent to a volume- 
limited and absolute magnitude-limited survey. 



3.1 Large-scale structure with quasars 

Quasars, like any other type of extragalactic sources, are bi- 
ased tracers of the underlying mass distribution: P q (k,z) = 
bq(z)P(k, z), where P(k,z) is the matter power spectrum, 
P q (k,z) is quasar power spectrum (the Fourier transform 
of the quasar two-point correlation function), and b q is the 
quasar bias . The quasar bias is a stee p function of redshift 
|Shen et alj (|2007l ); IRoss et alj (|2009l )]. and it may depend 
weakly on the intr insic (absolute) luminosities of the quasars 
|Lidz et al] l|2006l )]. but it is thought to be independent of 
scale (k) - at least on large scales. 

The connection between theory and observations is fur- 
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ther complicated by the fact that both the observed two- 
point correlation function and the power spectrum inherit 
an anisotr o pic co mponent due to redshift-space distortions 
[HamiltorJ jl997ft ]. In this work we will only consider the 
monopole of the power spectrum, P(k) = j^ 1 dfj,P(k, fj,), 
where [i is the cosine of the angle between the tangential 
and the radial modes. We will address the full redshift-space 
dataset from our putative quasar survey, as well as the re- 
sulting constraints thereof, in future work. 

To first approximation the statistical uncertainty in the 
p ower spectrum can be estimated using the formula derived 
in lFeldman et all (| 19941 ) for three-dimensional surveys: 



AP(k,z) 
P(k,z) 



N m (k,z) 



1 + 



n{z)b 2 (z)P{k,z) 



(13) 



where n is the average number density of the objects used to 
trace large-scale structure, and b is the bias of that tracer. 
The number of modes (the statistically independent degrees 
of freedom) in a given bin in fc-space is given by N m = 
A-kV{z,z + Az)k 2 Ak/(2n) 3 , where Az and Ak denote the 
thickness of the redshift bins and of the wavenumber bins, 
respectively. The first term inside the brackets in Eq. [13] 
corresponds to sample variance, and the second corresponds 
to shot noise (assuming the variance of the shot noise term 
is that of a Poisson distribution of the counts.) Since the 
power spectrum peaks at P < 10 4 ' 5 h~ z Mpc 3 , a quasar 
survey with n < 10 -5 h 3 Mpc -3 would be almost always 
limited by shot noise. 

For the purposes of this exercise we have used 28 bins in 
Fourier space, equally spaced in log(fc), and spanning the in- 
terval between 0.007 h Mpc -1 < k < 1.4 h Mpc -1 . Our ref- 
erence ma tter power spectrum Pp(k, z) is a modified BB KS 
spectrum iBardeen et al . Jl986l) l [see also iPeacockl (| 19991 ) or 
lAmendola fc Tsuiikawal (|2010h ]. The transfer function of the 
BBKS fit does not contain the BAO modulations, so we have 
modeled those fea t ures in the spe ctrum by mea n s of t he fit 
|Seo fc Eisensteinl ([20071 ); see also lBem'tez et alj <|2009h ]: 



P(k, z) = P (k, z) [l + kA sin(fcr S Ao)e" fc2fl2 ] 



(14) 



where t B ao = 146.8 Mpc = 105.7 h' 1 Mpc is the length 
scale of the BAOs that can be inferred from WMAP 
[Hinshaw et~ai1 (|2009h ]. A = 0.017 r bao is the amplitude 
of the acoustic oscillations, and R = 10 /i -1 Mpc denotes 
the Silk damping scale. 

Eq. (|13p is an approximation which is appropriate for 
spectroscopic redshift surveys, although this is not the type 
of survey that we are considering. Nevertheless, we have 
showed in the previous Section that, with narrow-band fil- 
ters, the error in the photo-z's of quasars can be lower than 
Sz ~ 0.002 (l+z), which is excellent but not quite equivalent 
to a spectroscopic redshift. Redshift errors smear structures 
on small scales along the line-of-sight, and can be factored 
into the estimation of the power sp e ctrum through an em- 
pirical damping term (Angulo et all ([20081 )]: 



exp 



-A' I 



H 2 (z) 



(15) 



where ku = k/j, denote the modes along the line-of-sight. 
In our ACDM model, photometric redshift errors suppress 
modes which are smaller than about fc^" 1 ~ 5 Z x 10 3 /i -1 Mpc 
at z = 2 (or k^ 1 ~ 8~ x 5.10 2 /i -1 Mpc at z = 4). Hence, a 



K 2 - 




Lo£ ]0 k [ft Mpc 



Figure 18. The contours denote the statistical errors in the 
power spectrum, log 10 AP(k, z)/P(k, z), for an 8.4 X 10 3 deg 2 
quasar survey, flux-limited down to g < 23, and limited to objects 
brighter than Mq = —22. From inside to outside, the contours 



correspond to AP/P = 10 -15 , 10 -1 , 10 -05 and 10°. The uncer- 
tainties were computed using Eq. 1131 For this plot we binned the 
redshift slices in intervals of Az = 0.1, and the wavenumbers were 
divided into 28 equally spaced bins in log(fc), spanning the inter- 
val between k = 0.007 ft- Mpc -1 and k = 1.4ft Mpc -1 . Photo-z 
errors and uncertainties in the bias of quasars are not included in 
our error budget. 



quasar photo-z error of the order of 0.002(1+2) only starts to 
affect the power spectrum at scales ku > 0.2h Mpc -1 at z=2, 
and ku > OAh Mpc -1 at z=4. This is smaller than either the 
Silk damping scale or the scales at which nonlinear effects 
kick in (see the discussion below), so we expect that photo-z 
errors will be a subdominant nuisance in the estimation of 
the power spectrum and derived parameter constraints. 

Another important point concerning Eq. (|13p is that it 
applies to the power spectrum as estimated by some biased 
tracer, but it does not automatically include the uncertainty 
in the bias or the selection fun ction, or other system atic ef- 



fects such as bias stochasticity Dekel fc Laha- 



we employ the fit found by [Ross et al.1 ([200 



|l99Sh ]. Here 
for quasars 

with z < 2.2, which is given by b q (z) = 0.53 + 0.29 (1 + z) 2 . 
Although this bias has large uncertainties, especially at high 
redshifts, we will implicitly assume that b q (z) is a linear, de- 
terministic bias that has been fixed at each redshift by this 
fit. 

In Fig. [18] we plot the contours corresponding to equal 
uncertainties in the power spectrum as a function of the scale 
[log 10 k (h Mpc -1 ), horizontal axis)] and redshift z (vertical 
axis), according to Eqs. (Ill[) - (|14[ ). and assuming that the J- 
PAS survey covers 8.4 x 10 3 deg 2 to a 5<r limiting magnitude 
of g < 23. There are three main effects that determine the 
shape of the contours in Fig. 1181 first, at fixed k and low 
redshifts, both the volume of the survey as well as the num- 
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ber density of objects (which is determined by the absolute 
luminosity cut) are small, while at high redshifts the number 
density falls rapidly due to the apparent magnitude cut. Sec- 
ond, for a fixed z the uncertainty as a function of k decreases 
up to scales k ~ 0.02 ft Mpc -1 , where P(k) peaks, and as 
it starts to fall, it increases the Poisson noise term in Eq. 
(|13[) . Finally, the redshift evolution of the power spectrum 



[P(k,z) ~ D (2), where D(z) is the linear growth function] 
also increases the shot noise at higher redshifts - although 
this effect is partly mitigated by the redshift evolution of the 
quasar bias. Quasars achieve their best performance in esti- 
mating the power spectrum at z ~ 1 - 3. This is because in 
that range the quasar bias increases faster than the number 
density falls as a function of redshift. 

A closely related way of assessing the potential of a 
survey to measure the power spectrum is through the so- 
called effective volume: 



V eB (k) = J d 3 



nb 2 P(k) 



l + nb 2 P{k) 



where x is comoving distance, and both the average num- 
ber density n and the bias b are presumably only functions 
of x (or, equivalently, of redshift). The effective volume is 
simply (twice) the Fisher matrix element that corresponds 
to the optimal (bias-weight e d) estimator of t he po wer spec- 
trum (Feldman et~ai1 (jl994r ); lTegmark et all (| 19981 )]. In Fig. 
[19] we show the effective volume for our quasar survey (full 
line). For comparison, we have also plotted the effective vol- 
ume of a hypothetical quasar survey similar to BOSS or Big- 
BOSS, that would target ~ 5.10 5 objects over the same area 
and with the same redshift distribution as the J-PAS quasar 
survey (long-dashed line). Also plotted in Fig. Q20 are the 
effective volumes of two surveys of LRG s assuming the lu- 
minosity function of iBrown et alj |2007l ). either in the case 
of a shallow survey flux-limited to g < 21.5 ("SDSS-like", 
short-dashed line), or for a deep survey limited to i < 23 
("J-PAS-like", dashed line.) 

In Fig. [20] we plot the power spectrum divided by the 
BBKS power spectrum Po(k), in order to highlight the BAO 
features. The error bars, from leftmost to rightmost (black 
in color version to orange in color version), corresponds to 
measurements of the power spectrum in redshit bins of A 2 = 
0.5 centered in z = 0.5, z — 1.0, z = 1.5, 2 = 2.0, 2 = 2.5 and 
2 = 3.0, respectively. The power spectrum at low redshifts 
is poorly constrained, but this improves at high redshifts 

(2-1-3). 

Figs. 1 1841201 demonstrate that quasars are not only vi- 
able tracers of large-scale structure, but they can also detect 
the BAO features at high redshifts. An interesting advan- 
tage of a high-redshift measurement of BAOs is the milder 
influence of redshift distortions and nonlinear effects. In lin- 
ear perturbation theory, the redshift-space and the real- 
space spectra are related by P j 3 Vpj r ^ ~ 1 + |/3 q + 
(Kaiserl (|l987T ): lHamiltorJ ljl997f )]. where /3 q ~ Q.° m 5b /b q in a 
flat ACDM Universe. Redshift distortions in the nonlinear 
regime are more difficult to tak e into account, but they als o 
scale roughly with j3„ - see, e.g..|jain fc Bertschingerl (Il994f) 



iMeiksin et alj (120011 ) ; IScoccimarro et all (12010 



20ld ) ; ISmith et alj (|2006l ); lAnguIo et"ail (|2008h ; 



Seo et al 



Seo et al 



2008). Since quasars become more highly biased at high 
redshifts, both linear and nonlinear redshift-space distor- 
tions are suppressed relative to the local Universe. 




-3.0 



LogjQ k [h Mpc" 



Figure 19. Effective volume of a flux-limited quasar catalog 
(g < 23 and z < 4) over 8.4 X 10 3 deg 2 . We also show the effective 
volume of a putative spectroscopic survey of quasars with 4.10 s 
objects, where we assumed the same area and redshift distribu- 
tion as was used for the J-PAS catalog ("BOSS-like", long-dashed 
line.) For comparison, we also show two hypothetical catalogs of 
luminous red galaxies (LRGs) over the same area, one limited to 
g < 21.5 ("SDSS-like", short-dashed line, blue in color version) 
and the other limited to i < 23 (long-dashed line, red in color 
version.) For the LRG estimates, we used the luminosity function 
of Brown et al. (2007) and assumed a constant bias 6lrg =1-5. 
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Figure 20. Baryon acoustic oscillations in position space. The 
oscillations are highlighted by dividing the full spectrum by a 
reference BBKS spectrum Po(k) without the baryon acoustic fea- 
tures. From left to right, the error bars correspond to the uncer- 
tainties at z = 0.5 (black curve and grey error bars), z = 1.0 (red), 
z = 1.5 (green), z = 2.0 (blue), z = 2.5 (purple), and z = 3.0 (or- 
ange). In this plot we employed redshift bins of Az = 0.5. The 
errors of the z = 0.5 bin are much larger than those of other bins 
because: i) the volume of the z = 0.5 bin is much smaller than 
that of other bins, which makes cosmic variance worse; and ii) the 
quasar luminosity function is more dominated by faint objects at 
low redshifts (see Fig. I16jl . and since we have culled those objects 
with our absolute luminosity cut, Mq < —22, the volumetric 
density drops by a large factor, thus increasing shot noise. 
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position-space power spectrum is modeled by: 





0.6 




0.5 ; 






® 

" — ' 


0.4 : 








0.3 ; 






% 


0.2 - 




0.1 1 


< 






0.0 




-0,1 ; 




-y(s,ni) 



(k,z) 



Log 10 k [ h Mpc l ] 

Figure 21. Scaling of the redshift distortions (outer, lighter con- 
tours and green lines in color version) and of the effects of non- 
linear structure formation (inner, darker contours and red lines 
in color version), for z = 1, z = 2 and z = 3 from top to bottom, 
respectively. The uncertainties caused by redshift distortions and 
nonlinear effects, A s n ;P/Po, are indicated by the hashed regions. 
For visual clarity, we shifted the distortions at z = 1 by +0.1, and 
the distortions at z = 3 by —0.1. We use the empirical calibration 
and errors of Angulo et al. 2008 for the redshift and nonlinear dis- 
tortions. For the quasar bias and its uncertainties we employ the 
fit of Ross et al. (2009). 



The effect of random motions can be taken into ac- 
counted by multiplying the redshift-space spectrum factor 
of 1/[1 + k 2 a s (z) 2 ], where a s (z) is a smoothing scale re- 
lated to the one-dimensional pairwise velocity dispersion, 
and is usually calibrated by numerical simulations. Nonlin- 
ear growth of structure and bulk flows (which tend to smear 
out the BAO sign a ture) also decrea se at higher redshift s 
(Smith et all [j2006T ): ISeo et~ai] l|20Qg| )]. I Angulo ' et all HH) 
found a useful parametrization of this effect in terms of a 
Fourier-space smoothing kernel W(k, k n i) = exp [-k 2 /2kl t ], 
where k n i(z) is a non-linear scale determined by numerical 
simulations. 

In Fig. [2l]we plot both the redshift distortions in linear 
theory, and the nonlinear effects on the power spectrum. For 
th e redshift d i storti ons we employ the quasar bias obtained 
in lRoss et all (|2009T l: 

b q = (0.53 ± 0.19) + (0.289 ± 0.035) (1 + z) 2 , 

which we assume holds up to z — 3 (even though the un- 
certainties are very large at such high redshifts.) For the 
smoothing paramet er we have extrapolated the data from 
I Angulo et all (|2008T ). and found a s ~ (4 - 0.96z)/i _1 Mpc 
(this approximation is good up to z ~ 3.) 

Finally, nonlinear structure formation eff ects are taken 
into a ccount by the nonlinear scale given in lAngulo et al.l 
(|2008l ) (which are appropriate for halos heavier than M > 
5 x 1O 13 M ): 

Ka{z) = (0.096 ± 0.0074) + (0.036 ± 0.0094)2 , 

in units of h Mpc -1 . 

With these assumptions, the ratio between the non- 
linear power spectrum in redshift space and the linear, 



= 1 + 



1 + k 2 a 2 



- 1 



Fig. [21] illustrates that the distortions become smaller at 
higher redshifts, and that the uncertainties associated with 
them are also being suppressed. 

In conclusion, we have seen that a large-area catalog of 
quasars, down to depths of approximately g < 23, can yield a 
precision measurement of the power spectrum and of BAOs 
at moderate and high redshifts. The fact that quasars can 
measure large-scale structure even better than LRGs around 
the peak of the power spectrum, despite their much smaller 
volumetric density, can be understood as follows. First, the 
volume spanned by quasars is larger, since they are much 
more luminous and can be seen to higher redshifts than 
galaxies. This makes both sample variance and shot noise 
smaller by a factor of the square root of the volume, ac- 
cording to Eq. (|13[1 . Second, although the number density of 
quasars is at least one order of magnitude smaller than that 
of LRGs, the bias of quasars increases rapidly with redshift, 
and becomes higher than that of LRGs at z ~ 1. Since the 
volumetric factor which determines shot noise is the prod- 
uct of the number density and the square of the bias, (nb 2 ), 
a highly biased tracer such as quasars can afford to have 
a relatively small number density. At or near the peak of 
the power spectrum, the accuracy of the power spectrum of 
quasars is almost limited by sample variance; slightly away 
from the peak, shot noise becomes increasingly relevant, but 
the vast volume occupied by a catalog of quasars means that 
they are still superior compared to red galaxies. It is only 
on very small scales, where the amplitude of the power spec- 
trum is very small, that galaxies become superior to quasars 
by virtue of their much higher number densities - but then 
again, this only works at the relatively low redshifts where 
galaxies can be efficiently observed. 

How, then, could such a catalog of quasars be con- 
structed? One possibility is multi-object spectroscopy. While 
target selection of quasars from broad-band photometry can 
be quite efficient in certain redshift ranges [such as z<2.2 for 
the SDSS filter set [Richards et~afl (|200lh ]. there are ranges 
of redshifts where the broad-band optical colors of quasars 
and the much more numerous stars are indistinguishable, 
especially in the presence of photometric errors. There is 
the additional problem of contamination from galaxies, but 
this should be a subdominant effect compared to stars (we 
leave this issue for future work). The comoving space density 
of quasars peaks between z=2.5 and 3, just the redshi ft at 
which the color locus of quasars crosses the stellar locus |Fanl 
(| 1999h ] . and selecting quasars in th is redshift range tend s 
to be quite ineffi cient and difficult (Richards et all ([20081 ); 
iRoss et al l (|201ll )[. 

A more concrete possibility is a narrow-band photomet- 
ric survey, such as J-PAS, which will take low-resolution 
spectra of all objects (including quasars) in the surveyed 
area. However, there are two problems with this technique: 
first, unless the photometric redshifts of the quasars are very 
accurate, the relative errors in their radial positions could 
be so large as to destroy their potential to map large-scale 
structure. This is even more critical if we want to measure 
the signature of BAOs in the angular and radial directions. 
In the previous Section we showed that it is possible to ob- 
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tain very small photo-z errors, so this should not be the 
main concern. The main problem will be the classification of 
point sources as either quasars or stars, especially for low- 
luminosity objects whose fluxes are noisy. Although stars 
and quasars can be distinguished by criteria such as the \ 2 
of their fits to templates, as well as the level of degener- 
acy of the PDF's of their photo-z's, stars vastly outnumber 
quasars, and hence stringent criteria must be used in order 
to preserve the purity of the quasar sample. This may com- 
promise the completeness (and therefore the final number 
density) of the quasar catalog, which would then lead to 
large levels of shot noise. 

Hence, the key to realizing the potential of quasars to 
measure large-scale structure in a narrow-band photometric 
survey hinges on whether or not we can type a sufficiently 
high proportion of quasars, and obtain accurate photometric 
redshifts for the majority of objects in that catalog. In the 
previous section we showed that this may be possible with 
an instrument such as J-PAS. However, our results can be 
easily generalized to other surveys such as Alhambra (which 
goes deeper than J-PAS, but has broader filters) and HET- 
DEX (which subtends a smaller area and has a similar depth 
compared with J-PAS, but has much better spectral resolu- 
tion). 



4 CONCLUSIONS 

We have argued that quasars are viable tracers of large-scale 
structure in the Universe. A wide and deep survey of these 
objects will be a zero-cost consequence of several ongoing 
or planned galaxy surveys that use either narrow-band filter 
systems or integral field low-resolution spectroscopy. 

Our estimates indicate that a dataset containing mil- 
lions of objects will be a sub-product of these spectrophoto- 
metric surveys, and that they can lead not only to measure- 
ments of the distribution of matter in the Universe up to 
very high redshifts (z < 4), but also to an improved under- 
standing of these objects, how they evolved, what are their 
clustering properties and bias, as well as their relationship 
and co-evolution with the host galaxies. Such a large dataset, 
spread over such vast volumes, will also allow a range of ap- 
plications that break these objects into sub-groups (of ab- 
solute magnitude, types of host galaxies, etc.) 

We have also shown that with a narrow-band set of 
filters (of width ~ 100 A in the optical) it is possible to 
obtain near-spectroscopic photometric redshifts for quasars: 
cr z ~ 0.001(1 + z) with the template fitting method, and at 
least a z ~ 0.002(1 + z) with the training set method. This 
means an unprecedented resolution along the direction of 
the line-of-sight that extends up to vast distances, and is 
a further reason for using quasars as tracers of large-scale 
structure. 



ACKNOWLEDGEMENTS 

R. A. would like to thank N. Bemtez for stimulating dis- 
cussions at several stages of this work. Many thanks also 
to G. Bernstein, A. Bongiovanni, I. Gonzalez-Serrano, B. 
Jain, A. Lidz, G. Richards, M. Sako, M. Sanchez and R. 
Sheth for several enlightening discussions. R. A. would also 



like to thank the Department of Astrophysical Sciences at 
Princeton University, as well as the Department of Physics 
and Astronomy at the University of Pennsylvannia, for their 
hospitality during the period when this work was done. MAS 
acknowledges the support of NSF grant AST-0707266. This 
work was also partially supported by FAPESP and CNPq 
of Brazil. 



References 



Lahav O.. Rashkov V.. 2008, arXiv 



Lahav O. 



Abbott T. et al., 2005, |astro-ph/0510346 
Abdalla F., Banerji M 

e-print 0812.3831 
Abdalla F., Amara A., Capak P., Cypriano E 

Rhodes J., 2008, MNRAS , 387, 969 
Abdalla F., Mateus A., Santos W., Sodre L., Ferreras I 

Lahav O., 2008, MNRAS , 387, 945 
Abell P. et al., 2009, 0912.0201 
Adelman-McCarthy J. K. et al. 
Adelman-McCarthy J. K. et al 

Catalog, 2282, 
Adelman-McCarthy J. K. et al. 
Amendola L., Tsujikawa S 



, 2006, 
2008, 



ApJS, 162, 38 
VizieR Online Data 



. 2008, ApJS, 175, 297 
2010, Dark Energy: Theory and 
Observations. Cambridge University Press 
Angulo R., Baugh C. M., Frenk C. S., Lacey C. G., 2007, 

MNRAS , 383, 755 
Baldwin J. A., 1977, ApJ, 214, 679 

Banerji, M. and Abdalla, F. B. and Lahav, O. and Lin, H, 

2008, MNRAS , 386, 1219 
Bardeen J. M., Bond J. R., Kaiser N., Szalay A. S., 1986, 

The ApJ, 304, 15 
Baum W. A., 1962, in IAUS Vol. 15, Photoelectric magni- 
tudes and Red-Shifts, p. 390 
Bemtez N., 2000, ApJ, 536, 571 
Bemtez N. et al., 2009, ApJ, 691, 241 
Blake, C, Glazebrook, K, 2003, ApJ, 594, 665 
Blake C, Bridle S., 2005, MNRAS , 363, 1329 
Blake C, Collister A., Bridle S., Lahav O., 2007, MNRAS, 
374, 1527 

Bolzonella M., Miralles J., Pello R., 2000, A&A, 363, 476 
Bond, J. R., Efstathiou, G., 1984, ApJ, 285, L45 
Bonfield D. G., Sun Y., Davey N., Jarvis M. J., Abdalla 

F. B., Banerji M., Adams R. G., 2010, MNRAS , 405, 987 
Brown M. J. I., Dey A., Jannuzi B. T., Brand K., Benson 

A. J., Brodwin M., Croton D. J., Eisenhardt P. R., 2007, 

ApJ, 654, 858 
Bruzual G., Chariot S., 2003, MNRAS , 344, 1000 
Budavari T., 2009, ApJ, 695, 747 
Coil A. L. et al., 2010, ArXiv e-prints 1011.4307 
Cole S. et al., 2005, MNRAS , 362, 505 
Collister, A. A. and Lahav, O, 2004, PASP, 116, 345 
Connolly A. J., Csabai I., Szalay A. S., Koo D. C, Kron 

R. G., Munn J. A., 1995, AJ, 110, 2655 
Cool R. J., 2008, PhD thesis, The University of Arizona 
Croft R. A. C, Weinberg D. H., Katz N., Hernquist L., 

1998, ApJ, 495, 44 
Croom S. M., Smith R. J., Boyle B. J., Shanks T., Miller 

L., Outram P. J., Loaring N. S., 2004, MNRAS , 349, 1397 
Croom S. M., Boyle B. J., Shanks T., Smith R. J., Miller 

L., Outram P. J., Loaring N. S., Hoyle F., da Angela J., 

2005, MNRAS . 356, 415 



© 0000 RAS, MNRAS 000, 000-000 



Measuring large-scale structure with quasars 19 



Croom S. M. et al, 2009, MNRAS , 392, 19 

Croom S. M. et al., 2009, MNRAS , 399, 1755 

Csabai I. et al, 2003, AJ, 125, 580 

da Angela J. et al, 2005, MNRAS , 360, 1040 

Dekel A., Lahav O., 1999, ApJ, 520, 24 

Dong F., Pierpaoli E., Gunn J. E., Wechsler R. H., 2008, 

ApJ, 676, 868 
Eisenstein D. J., Hu W., 1999, ApJ, 511, 5 
Eisenstein D. J., 2003, ApJ, 586, 718 
Eisenstein D. J. et al., 2005, ApJ, 633, 560 
Fan X., 1999, AJ, 117, 2528 

Feldman H. A., Kaiser N., Peacock J. A., 1994, ApJ, 426, 
23 

Firth A. E., Lahav O., Somerville R. S., 2003, MNRAS , 
339, 1195 

Hamilton A. J. S., 1997, in "The Evolving Universe", ed. D. 

Hamilton, Kluwer Academic, p. 185 (1998), arXiv e-print 

astro- ph/9708102| 
Hildebrandt, H. et al, 2010, A&A, 523, A31 
Hinshaw G., et al., 2009, ApJS, 180, 225 
Holtzman J., 1989, ApJS, 71, 1 

Hopkins P. F., Richards G. T., Hernquist L., 2007, ApJ, 
654, 731 

Hu W. & Sugiyama N., 1996, ApJ, 471, 542 
Jain B., Bertschinger E., 1994, ApJ, 431, 495 
Kaiser N., 1987, MNRAS , 227, 1 

Kormendy J., Richstone D., 1995, Ann. Rev. Astron. As- 
troph., 33, 581 

Lidz A., Hopkins P. F., Cox T. J., Hernquist L., Robertson 

B., 2006, ApJ, 641, 41 
Lima, M., Hu, W., Phys. Rev. D76, 123013 
Lynden-Bell D., 1969, Nature, 223, 690 
Ma Z., Hu W., Huterer D., 2006, ApJ, 636, 21 
Mandelbaum R. et al., 2008, MNRAS , 386, 781 
Martini P., Weinberg D. H., 2001, ApJ, 547, 12 
Meiksin A., White M., Peacock J. A., 1999, MNRAS , 304, 

851 

Moles M. et al., 2008, AJ, 136, 1325 

Moles M. et al., 2010, PASP, 122, 363 

Oyaizu H., Lima M., Cunha C. E., Lin H., Frieman J., 

Sheldon E. S., 2008, ApJ, 674, 768 
Padmanabhan N. et al., 2005, MNRAS , 359, 237 
Padmanabhan N. et al., 2007, MNRAS , 378, 852 
Padmanabhan N., White M., Norberg P., Porciani C, 2008, 

MNRAS , 397, 1862 
Peacock J. A., 1999, Cosmological Physics. New York: 

Cambridge University Press 
Peebles P. J. E., Yu J. T., 1970, ApJ, 162, 815 
Percival W. J. et al, 2007, MNRAS , 381, 1053 
Porciani C, Magliocchetti M., Norberg P., 2004, MNRAS, 

355, 1010 

Richards G. T. et al., 2005, MNRAS , 360, 839 

Richards G. T. et al., 2002, AJ, 123, 2945 

Richards G. T. et al., 2008, ApJS, 180, 67 

Richards G. T. et al., 2006, AJ, 131, 2766 

Richards G. T. et al., 2001, AJ, 122, 1151 

Richstone D. et al., 1998, Nature, 395, 14 

Ross N. P. et al., 2011, ArXiv e-prints 1105.0606 

Ross N. P. et al., 2009, ApJ, 697, 1634 

Salpeter E. E., 1964, ApJ, 140, 796 

Salvato M. et al., 2009, ApJ, 690, 1250 

Sawangwit U., Shanks T., Croom S. M., Drinkwater M. J., 



Fine S., Parkinson D., Ross N. P., 2011, ArXiv e-prints 
1108.1198 

Schlegel D. J., Finkbeiner D. P., Davis M., 1998, ApJ, 500, 
525 

Schneider D. P. et al., 2003, AJ, 126, 2579 
Schneider D. P. et al., 2003, AJ, 134, 102 
Schneider D. P. et al., 2010, AJ, 139, 23 
Scoccimarro R., Zaldarriaga M., Hui L., 1999, ApJ, 527, 1 
Scoville N. et al., 2007, ApJS, 172, 1 
Seljak U. et al., 2005, Phys. Rev. D71, 103515 
Seo H. J., Eisenstein D. J., 2003, ApJ, 598, 720 
Seo H.-J., Eisenstein D. J., 2007, ApJ, 665, 14 
Seo H. J., Siegel E. R., Eisenstein D. J., White M., 2008, 
ApJ, 686, 13 

Seo H. J., Eckel J., Eisenstein D. J., Mehta K., Metchnik 
M., Padmanabhan N. Pinto P., Takahashi R., White M., 
Xu X., 2010, ApJ, 720, 1650 

Sheldon, E. S. et al., 2009, ApJ, 703, 2217 

Shen Y. et al., 2010, ApJS, 194, 45 

Shen Y. et al., 2007, AJ, 133, 2222 

Smith R. E., Scoccimarro R., Sheth R. K., 2006, Phys. Rev. 
D75, 063512 

Sunyaev R. A., Zeldovich Y. B., 1970, Astroph. Space Sc., 
7, 20 

Tegmark M. et al., 2006, Phys. Rev. D74, 123507 
Tegmark M., Hamilton A. J. S., Strauss M. A., Vogeley 

M. S., Szalay A. S., 1998, ApJ, 499, 555 
Vanden Berk D. E. et al., 2001, AJ, 122, 549 
Wang X. et al., 2009, MNRAS , 394, 1775 
Wolf C. et al., 2003, A&A, 401, 73 
Wolf C. et al., 2003, A&A, 408, 499 
Yahata K. et al., 2005, PASJ, 57, 529 
Yip C. W. et al., 2004, AJ, 128, 2603 
York D. G., et al., 2000, AJ, 120, 1579 
Zel'Dovich Y. B., Novikov I. D., 1965, Soviet Physics Dok- 

lady, 9, 834 



© 0000 RAS, MNRAS 000, 000-000 



